Modified Ostrogradski formulation of field theory 
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We present a method for the Hamiltonian formulation of field theories that are based on Lagrangians 
' containing second derivatives. The new feature of our formalism is that all four partial derivatives of the field 

variables are initially considered as independent fields, in contrast to the conventional Ostrogradski method, 
fT^ , where only the velocity is turned into an independent field variable. The consistency of the formalism is 

^ ' demonstrated by simple unconstrained and constrained second order scalar field theories. Its application to 

' General Relativity is briefiy outlined. 

(N ' 
O 

! 1 Introduction 



There are two main properties one usually requires a Hamiltonian to possess. First, it should be a conserved 
quantity, the energy, and second, it should generate the time evolution of the system under consideration. 
Both features are intimately related, since in relativistic theories, the energy can be viewed as the variable 
canonically conjugate to time. However, since the Hamiltonian formalism is based on a 3+1 split of spacetime, 
this relation is not always directly obvious. As a matter of fact, in classical mechanics, but also in field theory, 
the Hamiltonian is usually introduced via a Legendre transformation of the Lagrangian, and the relation to 
the energy (i.e., to the time component of the integrated stress-energy tensor, in the field theory case) is only 
. established afterwards and appears rather as a coincidence. 

^ ' Here instead, we choose to proceed the other way around, that is, we start from the expression for the 

energy and try to find a set of canonical phase space variables such that the Hamiltonian that emerges after 
substitution of those variables into the initial expression, does indeed generate the time evolution of the system. 

The stress-energy tensor for second order theories can be derived with Noether's theorem (see our review 
paper [2]) from invariance of the theory under coordinate transformations. Assume that the fields, collectively 
denoted with (p, transform as 

5ip^e^.^ + leA''^)^, (1) 

where {cr'p)\ denotes the action of the generators of the general linear group on (p. The explicit form depends 
on the scalar, vector or tensor nature of the fields (p. (In other words, dip is the Lie derivative of the field 
with respect to ^'.) Invariance under global transformations = const leads to t\ = 0, where the canonical 
stress-energy tensor is defined as 

I dC , dC , ] dC 



A= J, (7^ v.^ + j, v,^,i-5tC. (2) 
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It can further be shown (see [T or f5]) that for generally covariant Lagrangians, t\ can be brought into the 
form 



dC 



1 dC 



k,l 



2dip,t 



dC 



7n.l 



dC 



m.l 



(3) 



where the expression in the first bracket was shown to be antisymmetric in kl, and the totally symmetric part 
(in mlk) of the expression in the second bracket was shown to be zero. As a result, r'^j = identically, in 
accordance with the second Noether theorem. 

In first order theories, it is convenient to introduce four momentum variables tt^*' = dC/dip^i, and then 
perform the 3+1 split with the help of a timelike unit vector n*, which specifies the direction of the time 
evolution. The physical momentum is then given by tt = n^'^hii (and the velocities by (p^iU^). In this way, one 
can set up an explicitely covariant Hamiltonian formulation of the theory (see [3]). Throughout this paper, we 
assume rii — 6^, such that tt — t:^^' . The introduction of 7r(*) is nevertheless useful, since it allows to write the 
stress-energy tensor in the simple form Tr'^'^V.i ~ ^i^- 

Based on those considerations, it is natural, in the framework of second order theories, to introduce the 
following momenta 



dC 



(■ 



dC 



m(i) 



dips dip^kA ' 
which contain the physical momenta (i.e., along rii = Sf) 

(0) ciC / dC 

d(p dpi^k 



dC 



= p™(0) ^ 



dC 

d(p,m 



(4) 



(5) 



where the dot denotes partial time derivation. Note that we use latin letters for spacetime indices and greek 
letters for spatial indices (and zero for the time component). Further, we take the convention that expressions 
of the form dC/dtp^fi or equivalently, dC/dip^, are always to be interpreted as the component m = of the 
initial expression dC/dip,i,m- (This is very important, for instance if £ = (p^^mf'^'"^- With our convention, we 



have dC/dp>^^i^o — 2ip'^^'^, although the mixed term ip,i 



.0^ 



is actually contained twice in C.) 



The stress-energy tensor ([2]) can now be written in the form 



m{k) 



(6) 



where it is tempting to see in ■(/',„ = </',m the variable canonically conjugate to 
conserved form of the stress-energy tensor from ([3]) takes the form 



(ap)'- 



nJ(™) 



Further, the identically 



(7) 



where (aipj)'^^ = 26j(ps + [(o'(p)'^j] .j , i.e., it acts correctly on tpj in accordance with its total tensor structure, 
taking account of the additional vector index. 

The last relation holds only in generally covariant theories (e.g.. General Relativity), and is given here only 
to demonstrate how naturally one is led to the specific choice of canonical variables. For the moment, we confine 
ourselves to special relativistic theories. 

The conserved momentum is given as integral over a three dimensional hypersurface, Vi — J T^dak- For 
Hi = 5f, this is simply Vi = J r^d'^x. We see that only the physical momenta ([5]) enter that expression. In 
particular, for the field energy H — Vq, we find 



n 



C d^a 



(8) 
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where we use already the letter H although we yet have to eliminate the velocities. 

Until this point, we have done nothing but written down the canonical field energy in terms of certain 
variables. Whether or not we can indeed interpret </3,7r and iprmT^"^ as canonical phase space variables and 
whether or not Ti. generates the time evolution of those variables is still an open question. However, from the 
above relations, any different choice of variables would seem unnatural. 

The conventional Hamiltonian formulation of second order theories is based on two pairs of canonically 
conjugated variables, ip, tt and ip^ tt, with -0 = ^- This method goes back to Ostrogradski. We will investigate 
the relation between both methods at a later stage of this paper. Let us just remark that the fact that we use 
more variables should ultimately result in a theory with more constraints. If this is not the case, then both 
methods cannot be equivalent. 

Finally, we assume the following equal-time Poisson brackets 

[^{x),TT{y)]^S{x-y), [V'™(a;),/(y)] =(5f„(5(a;~2/), (9) 

and zero for any other bracket, e.g., [ip,iprn] — 0. For simplicity, we use x,y to denote the space points x^,y^, 
and 6{x — y) = 5^^\x — y) for the three dimensional delta function. The covariant form (i.e., with respect to a 
general, unspecified hypersurface) of those relations are given in !2j. Note that even classically, the relations ^ 
cannot be verified. It is indeed possible to construct the Poisson bracket such that it gives canonical relations 
for an arbitrary choice of variables. The only thing subject to verification is the resulting Hamiltonian theory. 
To this we turn now. 

In the next two sections, we will apply the above formalism to several simple second order scalar field 
Lagrangians. It turns out that in all cases, the formalism reveals itself to be consistent. In section |4l we 
briefly analyze the relation to the conventional Ostrogradski formalism. Finally, in section [5l we treat General 
Relativity as a constrained second order theory and in section [6l we briefly discuss the relation of the first class 
constraints to the diffeomorphism invariance of the theory. 



2 Unconstrained theories 

As a first example, we consider the Lagrangian 

£-i^,,^-^ + in^n^, (10) 

where we use the notation Ut^ = and Ac/? — ip''-'-^ (/i = 1, 2, 3). The field equations are UOup — 'Oup = 0. We 
do not discuss the physical relevance of such a theory (therefore, coupling constants have been omitted). The 
inclusion of a potential, in particular a mass term, is trivial and does not lead to significant modifications of 
our discussion. In what follows, we will refer to the theory based on the above Lagrangian as example I. 
The momenta are found from ([5]) in the form 

TT = iP-UiP, f^^S'^Uip. (11) 

It turns out to be convenient to simplify the notations for the time components of and ipm — 'P,m, and 
to use = p and t/iq = ^- Thus, we have ip = (p (as in the Ostrogradski formulation), as well as = V'./j- 
Those three relations do not contain velocities, and must therefore be considered as constraints. Next, we have 
p = Dip = ijj + Aip and t: — ip — p, both containing velocities, as well as p^ = 0, which are constraints again. 
As a result, we have the following 6 constraints 

$M = ^A.-'^,/^. *^=p^ (12) 
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which satisfy 

['^„^'']^s;s, (13) 

as weh as [$^, $1,] = [^'^, Vf] = 0, where we omit the arguments for simphcity of the notation. Thus, we are 
deaUng with second class constraints, which have to be dealt with by the introduction of the Dirac bracket (see 
[3]). For the specific structure it is not hard to show that the Dirac bracket can be written in the form 

[A, B]* = [A, B] + [A, [vl/^ B] - [A, vj/^] [$^, B] . (14) 

Note that apart from a summation over ^, there is also an integration over the argument of $^(2) and \l/^(z) 
involved, which is suppressed by our simplified notation. 
We now easily find the following Dirac bracket relations 

[^,A]* ^[^,A], [i^,A]* ^[i^,A], [V'^,7r]* =5^, b,^]*=0, [p^V'.]* = 0, (15) 

where A is arbitrary. As it turns out, the relations are exactly those that could have been assumed right from the 
start if one would not have considered -0^ and as canonical variables. This is rather a coincidence, however, 
as we will see in the next example. (The notation (5,^ is to be interpreted as /(5,^ = —f,iiS for a function /. The 
omission of the arguments is not without danger. Note, for instance, that from [ipfj,(x),n{y)] = 5,p(x — y), we 
find [7r(j/), "0^1(2;)] = —S,p,ix — y) = 5^ti{y — x), and thus, in short notation, [tt, 0;^] = 5^^, contrary to what could 
have been expected from the initial relation [i/;^, tt] = and the antisymmetry of the Poisson bracket.) 

In any case, we can now impose the constraints as strong relations between the variables, and thus eliminate 
0^ and as independent field variables. Then, we use ([8]) in order to write down the Hamiltonian. The 
velocities (of the remaining variables Lp and ■0) are easily expressed in terms of the momenta as ip = ip and 
■ip = p — A(p. We find 

n = + _ _ 1^2 _ 1 <^^^<^,A«)d3a;. (16) 

With the help of ([TS|), we find 

[H,(^]* = -0 = -^ (17) 

[H, tt]* = -Ap + A(/3 = -(□□(/? ~ Uip) ~{(p- up) ^ -TT, (18) 

= -P + A(/. = -V;, [n,p\* = n - i) = -p. (19) 

Thus, the Hamiltonian does indeed generate the time evolution of the fields. To conclude, despite the fact 
that additional pairs of variables (■0^,7r^) revealed themselves as irrelevant, the formalism has nevertheless 
successfully passed its first test. 

We now start form the Lagrangian 

= \^,^^-' + \v,^,n^''''^■ (20) 

This theory, which will be referred to as example II, is equivalent to the previous one in the sense that it leads to 
the same field equations. It differs, however, by a four divergence and therefore, differences in the Hamiltonian 
theory will arise. We find tt = p — Up, p™ = p'"^ , and thus, writing again ip = ipQ and p = we have 
TT ~ tp — p^ ^ — p and p = ip, which are relations involving velocities. In addition, we have the constraints 

<i>M = V'm - V'.M^ *^=P^-0''^, (21) 
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which satisfy again — [^''^,'1''^] and 

[^,,^'']^S';S. (22) 

Ahhough the constraints satisfy the same Poisson structure as in example I, they are nevertheless fundamentally 
different because of the explicit occurrence of ip. Indeed, we now find the following Dirac brackets 

[ip,AY ^[^,A], [^p,A]* ^[^,A], [V'^,^]*=<5,^, [p,nY = -AS, [p^V^.]* = (23) 

for arbitrary A. They are identical to except for the relation [p, tt]* = —AS, which is of course again a 

symbolic notation for f{A6) — {Af)6. Imposing the constraints as strong relations in 7i, we find 

n = y (tt^ + - - iy^,^^'^ - ^^,^.,^'''n<i'x, (24) 

where the velocities have been eliminated hy — p and ipp, = TTfj, = ip^fj,, as well as ip = ip. We now easily derive 

[n,ip]* = ~^^-ip (25) 

[n, tt]* = ~Ap + Aip- AAtp = -Aif + Aip + AAip = {Utp - UUip) - {(p - Uip) -tt, (26) 

[n,ip]* = -p^ -ij, [n,p]* ^ Aip + TT-i/j^-p. (27) 

Again, the formalism works perfectly well. As before, the additional variables {ip^i,,p^) could be eliminated 
after the introduction of the Dirac brackets. It is expected that this is a general feature of our formalism, if 
there is any hope for it to be equivalent to the Ostrogradski formulation based on only two pairs of variables. 
On the other hand, it should be noted that, in contrast to example I, in the present case, we could not have 
anticipated the relation [p, tt]* = —A5. If we simply ignore the variables tt'^ and -0^, then this relation cannot 
be derived, since we need the complete set of constraints pip to get the correct Dirac brackets. Finally, in order 
to avoid confusion, we should mention that in the title of this section, we use the term unconstrained in the 
sense that in the conventional Ostrogradski formulation, those theories are indeed free of constraints. In our 
modified formalism, there will always be at least those constraints that eliminate the variables and p^ . In a 
constrained theory, there will be additional constraints. 



3 Constrained theory 

We now consider the Lagrangian 

£ + a(/jn^. (28) 

This theory [example III) is equivalent to the conventional first order scalar field theory and we can thus expect 
that the application of the second order formalism leads to a constrained system. The momenta are found in 
the form tt = Lp{\ — a) = ip{\ — a) and = SQ^atp. Thus, apart from the constraints 

^^^iP^-ifi,^, Vl/A'^pA^, (29) 

we now have the additional constraints 

^=p-a(p, * = TT - (1 - a)i/', (30) 
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since none of those relations involves velocities. The constraints are all second class. Rather than deriving 
directly the Dirac brackets for the system of those eight constraints (at each point x), it is convenient to proceed 
in two steps. First, we construct the Dirac bracket that eliminates the constraints (P^ . (The construction of 
the Dirac brackets by eliminating the constraints in two steps will be given at the end of this section.) 

The constraints are identical to those of example I, and the result, as we have seen, is simply that we can 
eliminate the variables '0^ and in terms of ip^^. The Dirac brackets for the remaining variables are identical 
to the initial Poisson bracket, see ([15]). We will therefore retain the notation [A, B\. 

In a second step, we turn to the constraints which satisfy 

[$,*] = (1 - 2a)(5. (31) 
The corresponding Dirac bracket is easily shown to be of the form 

[A, B] * - [A, B] + [A, $] [VI/, S] - [A, vf] B] , (32) 

where again an integration is suppressed by our notation. We find the following relations 

[^,^V=-^—^S, [(p,p]* = [^,0]* = O, [n,p]* =-a^^ 6, [^,^]* = -i_5. (33) 
1 — 2q; 1 — 2q! 1 — 2a 

The Hamiltonian is found from ([5]) upon imposing the constraints as strong relations. The result is 

^ = / {jT^ " " " d'^. (34) 

If we rescale the momentum tt by introducing tt — tt, such that [n, ip]* = —S, we can alternatively write 

H = 1 Q ^'-^k' - d'^. (35) 

Apart from the last term, which is a surface term, the Hamiltonian corresponds to the conventional first order 
Hamiltonian derived from £ = — a)(p,i'P'^, which is equal to (|28p up to a four divergence. 
For the rest, the relations 

[n,ip]*^-ip, [n,7r]*^~7r (36) 

are easily verified. Thus, our formalism works even for such a strongly constrained system. 

The Lagrangian ()28p for the specific value a = 1 can be viewed as a toy model that mimics in some sense 
the Lagrangian of General Relativity. Namely, the Lagrangian ^/—g R consists of a part containing only first 
derivatives of the metric and a part that contains the metric and its second derivatives (linearly). The second 
part equals, up to a four divergence, the double of the opposite of the first part, similarly as in for a = 1. It 
is indeed the scope of our exercise to provide a Hamiltonian formalism that can be applied to General Relativity 
in its explicitely covariant form, in contrast to the conventional first order method, where a surface term has to 
be omitted, resulting in an effective Lagrangian that is not explicitely covariant. 

At first sight, this may look like an unnecessary complication, since the number of variables is initially 
increased only to be reduced again at a later stage by imposing the constraints. Nevertheless, it is hoped that 
despite those computational complications, there will be an improvement of clarity in particular concerning the 
physical meaning of the constraints of the theory. Indeed, it is well known that the primary and secondary 
first class constraints arising in generally covariant theories are directly related to diffeomorphism invariance. 
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It turns out that those constraints can be directly inferred by a straightforward analysis of the corresponding 
Noether currents (that is, the stress-energy tensor). The explicit form of the constraints and their action as 
generators of coordinate transformations has been given in [5] for first order theories, and similar relations are 
easily derived for second order theories, following along the same lines. On the other hand, if we work with 
an effective, not explicitely covariant Lagrangian, the relation between constraints and generators of coordinate 
transformations is more obscure and explicit calculations have to be performed in order to determine the action 
of the constraints on the fields. For instance, simple relations, like the symmetry properties of ([3]), are not valid 
anymore. There will be, of course, alternative relations expressing the (hidden) coordinate invariance, but those 
will not emerge directly from Noether's theorem and have to be obtained more or less by guesswork. 

A second, related, issue concerns the occurrence of surface terms in the Hamiltonian. In view of the specific 
asymptotic behavior of the metric, e.g., goo — 1 — m/r (for asymptotically flat spacetimes), surface integrals 
occurring in General Relativity do not always vanish, in contrast to conventional field theories. In fact, it is 
not hard to show from ([3]) that the only field that explicitly contributes to the integrated field momentum 
is the gravitational field. This raises problems when the effective, first order Lagrangian is used in General 
Relativity, since the omission of four divergences in the Lagrangian ultimately leads to a modification of H by 
surface terms, see, e.g., (1351) . Initially, in the context of canonical quantum gravity, certain surface terms in 
the Hamiltonian where simply ignored, since they are dynamically irrelevant [5]. Later [B], it was recognized 
that by the omission of those surface terms, we actually omit the complete field energy of the system (such 
that the resulting Hamiltonian vanishes weakly) and it was argued, based on comparison with the linearized 
theory, that those terms should not be omitted (except for closed spaces). This was confirmed in [7], where 
it was shown that without those terms, the Hamiltonian formulation of the theory is classically inconsistent, 
because the variations STi./6(p and STi/Sir cannot be properly defined if those terms are missing, and thus, we 
cannot write down the Hamiltonian equations of motion. For a discussion of the treatment of surface terms in 
the variational principle of field theory, see also [8] and [9] . 

Obviously, in view of this situation, it seems promising to start directly from the full Lagrangian ^/—g R, 
instead of omitting first a four-divergence, and then eventually reintroduce it again (in the form of a three- 
divergence) into the Hamiltonian in order to get a consistent theory. With the use of the second order Hamil- 
tonian formulation, it should be possible to proceed strictly canonically, without ever being in the need to 
omit or add a surface term. The resulting Hamiltonian can be interpreted directly as field energy and should 
generate the time evolution of the system, provided we are able to deal consistently with all the constraints. In 
asymptotically flat spacetimes, it does not vanish weakly. We will outline this procedure in section [SI 

Since it might not be obvious that the construction of the Dirac brackets in two steps (namely, first elimi- 
nating the constraints l|29p and then the constraints (|30p ) leads indeed to the same result than the construction 
following directly the procedure of Dirac, we close this section by giving a justification of this procedure for an 
arbitrary theory. 

Suppose we have second class constraints where it is irrelevant whether the labels i,a run over a 

finite set (index) or an infinite set (like the argument x). Let the Poisson brackets be given by 

with invertible, antisymmetric C"'^, Z?"'', the inverse being denoted by Cik and D^p respectively. Nothing is 
assumed for [5'% They may or may not vanish. 

Let B be any expression of the canonical variables (fields (or coordinates) and momenta). Then, in a first 
step, we define 

[A,B]* ^[A,B]-[A,m']C,u[^\B], 
from which we easily find [A, v]/™]* = [ij/'"^ B]* = for any of the ^''"'s and for arbitrary A, B. 
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In a second step, define 

[A, B]** = [A, B]* - [A, B 



Obviously, we have [A, $'']** = [$'>', B]** = for any of the $"'s and for arbitrary A,B. But since we also 
have [^,*']* = = for any A,B (and thus, in particular, e.g., [^r^^"]* = 0), we find that trivially 

[A, ^i]** = [-jr*^ B]** = for any A, B. 

Moreover, from the above construction, it is also clear that for those A, B that commute with all of the 
constraints (i.e., [A, ^''] = [A, = 0, and similar for B), we have [A, B]** = [A, B]. 

Summarizing, the bracket [ , ]** has the following properties: (1) S*^]** = for all of the second class 
constraints S*^ = (^f^^") and for arbitrary A. (2) If A,B commute with all of the constraints ([A, S''^] = 
[B, S^] = 0), then [A, B]** = [A, B]. But those are exactly the properties that define the Dirac bracket. 

In other words, the bracket [ , ]** has exactly the same properties as the bracket 



where E'^^ = [S^, !B^], and Emk the inverse of E'^^ . But since the bracket defined by the above properties 
(1) and (2) is unique, wc must have [A. = [A, B]** . There is thus no need to show this explicitely. 
This justifies the two step construction of the Dirac brackets. 

4 Original Ostrogradski formulation 

We started our investigation from the explicit expression of the canonical stress-energy tensor. From its struc- 
ture, it was most natural to introduce and — V.m as independent variables and to base the canonical 
Hamiltonian formalism on that. On the other hand, the canonical stress-energy tensor is not the only con- 
served current available. As is well known, arbitrary relocalization terms can be added to t\ without changing 
the relation t\ ^ = 0. In special relativistic theories, not even the integrated momentum is changed by such 
a procedure. (Gravity, however, provides an exception to this, because of the previously mentioned different 
asymptotic behavior.) In view of those ambiguities concerning in particular the energy density, it is not really 
surprising that there may also exist several Hamiltonians for one and the same theory. 

There is a simple way to get at least to two of such Hamiltonian descriptions. As is well known, in first order 
theory, the momentum can be derived directly from variation of the action functional as 5S/5ip = dC/dip = -k. 
Here, the notation 5 A denotes a three dimensional variation, i.e., if we find for the variation of a functional 
A{^. n) that 5A = J(ai6ip + aiS7r)d^x, then by definition, ni = SA/Sip and 02 = SA/Stt. It turns out that 
in second order theories, a similar variation of the action does not lead to a unique definition of the canonical 
momenta. Indeed, we have 



Using the field equations d£/dip — {d£/dip^i)^i — {d£/d(p^i^k),i,k in the first term, then performing several 
partial integrations, where three divergences can be omitted (surface terms), while the time integration over 
time derivatives can be carried out explicitely, one readily finds 



[A,B]* = [A,B] - [A,S^]Emk[S^,B] 




(37) 




(38) 
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which leads exactly to our previously adapted choice of fields and momenta 



SS_ _ dC dC 

Sip dip dip,i ' 



6S 



dC 

dip.i 



P ■ 



(39) 
(40) 



On the other hand, one can also do with less variables. Writing the integrand of the second contribution in 
(j38p in the form {dC/dip_i Sip)^i — {dC/dip^i)^i5ip, then omitting in the first term of this expression the three 
divergence and carrying out the time differentiation, we find 



6S 



dC , dC 



dC 



dp dip 



dip^^. 



Sip + ^^^'P ) 



which leads to 



SS_ 

Sip 

Sip 



dC dC dC 

d^~^^Wj''~^W'' 

d£ ^ . 
dip ^' 



(41) 

(42) 
(43) 



that is, to a theory with variables ((/?, iIj — ip) and corresponding momenta (7r,p). Writing down the Hamiltonian 
for this theory, that is, 

n= [{nip + piP- £)d^x, (44) 



and comparing it with our initial Hamiltonian Ti 
by a surface term 

dC 



J{TTip- 



Vm — C)d'^x, we find that the difference is given 



H 



-ip dcTp 



(45) 



Inasfar surface terms are assumed to vanish (as has been assumed during the derivation of ([55)1 and (|¥T|l ). both 
expressions are equal. This does still not mean that the corresponding Hamiltonian theories are also equivalent. 

The formulation based on treating the velocities as independent fields is known as Ostrogradski formulation, 
see, e.g, [10] and [TT] and in particular [12 , where the formalism has been adapted from the case of a finite 
number of variables to the case of field theory. (The point is that in the finite case, their is no such thing as a 
spatial derivative, and the distinction between the above presented formulations cannot be done anyway.) 

Let us briefly verify the consistency of the formulation based on Ti. for the three examples we previously 
dealt with in the alternative formulation. It seems obvious that for the cases of the Lagrangian (llOp (example 
I), as well as for the constrained theory (I^S)) (example III), both formulations are trivially equivalent, because 
of the absence of mixed derivatives </?,p,o- Indeed, we have for those cases p = p and n — while the variables 
4>fi and pf^ could be eliminated without any changes of the Poisson brackets between the remaining variables. 

Only the case (|20| (example II) deserves closer examination. We find from p2|) and (|43)) the momenta 
p ~ ip = and TT — tjj — 2Aip — (ip) — ip — 2A'ip — p. The system is thus free of constraints. The Hamiltonian 
(l44l) takes the form 



n 



1 



1 



1 



1 



ip.^^,,ip'>''nd^x, 



(46) 
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where the velocities have been eliminated with (p = 'tp and ip = p. It is needless to say that the only non- vanishing 
fundamental Poisson brackets in the present formalism are assumed to be 

[n,^]^-S, [p,^P]^~S. (47) 

Note, by the way, that if we compare with the momenta of the alternative formulation of section[^ we have p — p 
and TT — n — Aip. Therefore, from the above Poisson bracket, we can directly derive the relation [p, tt] = —AS 
that arose in the other formulation upon defining the Dirac brackets (see ([Ml))- 
Next, from (|46)) . we find 

[n, ip] = ~iP = -if, [H, ^]^~p= -V-, (48) 

as well as 

[k,^]= Alp- AAlp = [H, p] = -VJ = -p, (49) 

where the field equations have been used in the first relation. Thus, the Hamiltonian H generates indeed the 
time evolution of the phase space variables if, ip, tt and p. 

To conclude, we see that both the initial Ostrogradski formulation, as well as our modified formulation, 
lead to consistent Hamiltonian theories for the simple models analyzed here. Numerically, the corresponding 
Hamiltonians differ by a surface integral. 

As outlined at the end of the previous section, we expect certain improvements of clarity by the use of our 
modified formulation. Although the Hamiltonian formulation necessarily induces a 3+1 split of spacctime, it 
seems nevertheless in the spirit of a covariant theory to treat ip ^^ and (p in a, symmetric way, at least initially. 
One advantage of such a procedure has already be encountered: The resulting Hamiltonian is directly given by 
the integrated time component of the canonical stress-energy tensor, i.e., by the time component of the four- 
momentum Vi = J T^dak- On the other hand, the Ostrogradski Hamiltonian is not a component of anything 
(it corresponds rather to a generalized Legendre transform of the Lagrangian), and its conservation as well 
as its identification with the energy have to be established separately. A direct relation to the stress-energy 
tensor, and thus to the Noether current corresponding to the translational invariance of the theory, should 
turn out to be profitable in generally covariant theories, where we will have to deal with constraints related to 
diffeomorphism invariance (see [2]). 

5 General Relativity 

We start from the Lagrangian 

C = V^R, (50) 

where for simplicity, we omit the factor — i which is necessary to get the correct sign for the energy. The field 
variables, in the second order formalism, are gik and ipikm = gik,m- Further, we find from ([5|) 

ik a/ ^ r -pi ^Ifn kO -p/c ^,lm iO , pi ^10 ^km , pfc ^/O^iml 

= —^[~>-lm9 9 -^lni9 9 +''lm9 9 + ^ l-m9 9 \, l&l) 

pikm ^ VZl [giOgkm _^ gkOgtm 2g''' g^""] . (52) 

The Poisson brackets are assumed to be 

^Irni cik r r^/. ^Irnri ci/e cr 



tt'"] = C<5, [V'.fc,, tt'"-] = C5,^<5, (53) 
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where we use the famihar notation = ^{S^S'^ + S^^S^). 

We use again the simphfied notation p*'^ = and tpik — V'ifco = 9ik,o- Obviously, the relations fSTj) 
and ()52|) are all constraints, since the momenta are expressed in terms of ipikm and gik (and not in terms of 
velocities). In addition, we have the constraints ipikp, — gik, p. (recall that greek indices run from 1 to 3). As a 
result, we have a total of 80 constraints, and at first sight, most of them seem to be second class. Surprisingly 
enough, with a little bit of patience, the above system can be handled without major difficulties. 

Similar as in example III, we divide the constraints into two groups, with the first group consisting of 

^ikii = i^ikfj. - gik,fj. (54) 



^ik^ ^ ptkf. _ V_9 y^Ogk^. _^ gkOgi^ _ 2gi''g0t^] , (55) 

while the second group contains the remaining constraints, 

^ik ik a/ ^ r -pi ^Im kO -p/c ^Irii AO i pi ^10 ^km , pfc ^10 Ami fr.a\ 
=7^ — [-^ Inid 9 " i lm9 9 +''lTn9 9 + ^ lm,9 9 \ l&b) 

= - ^ [g'Og'^O _ g^l^gOO] . (57) 

We begin with the first group. Those constraints satisfy 

[^^k^,^'"n ^ StS-^S, (58) 

as well as [^ik^, ^im,^] = [^'''^^, 'if'-™-"] = 0. This is very similar to our previous examples, and the corresponding 
Dirac bracket reads 

[A, B]* = [A, B] + [A, <I>,;fo„][**'=™, B] - [A, vl/*fe™][$,fe„„ B], (59) 

as is easily verified. (One has to check that the Dirac brackets between any quantity and any of the (above) 
constraints vanishes.) We can now eliminate the variables ipiki.i and p**^'' by imposing the constraints as strong 
relations. It is not hard to verify from (j59p that for the remaining variables, we have again that the Dirac 
bracket is identical to the initial Poisson bracket. Indeed, we have [A, gik]* = [A, gik], [Ajipik]* = [Ajtpik] for 
any A etc., see the corresponding relations (fT5|) of section[2l In particular, we can also check that [7r*'^,p'™]* — 0, 
just like in examples I and III. 

To conclude, the 60 constraints (j54|) and ([55|) can be imposed strongly (that is, ipikfi andp**^'' are eliminated), 
and the remaining brackets remain unchanged. As in example III, we will denote them again by [A, B], without 
star. 

We are thus left with the 20 constraints (|56p and (|57)) . At this stage, the phase space variables are gik, ipik,'^^'^ 
and p*'^'. Most constraints turn out to be second class (note that some components of r\i depend on ipik = gik,o 
and have non- vanishing brackets with p^^). The explicit calculations are quite long, and we will only present 
partial results here. First, we notice that we have 

j^ifc^^imj^Q^ (60) 

Further, we can derive the relations. 

In particular therefore, is first class. Let us write the remaining brackets in the form 
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There are further non- vanishing brackets [$'''^,$'^*] and [^^^j^^*^], for which we do not introduce specific sym- 
bols. Note, however, that are not first class. It is relatively easy to show that G^^^'^ is given by the following 
expression 



1 



(63) 



where the right hand side has still to be symmetrized with respect to fiiy as well as with respect to SX. At this 
point, it is convenient to introduce the Arnowitt-Deser-Misner parameterization of the metric, that is, we write 
500 = N"^ - gt^iuN^^N", go^ = -Nf_, and = -fl/^i/, where N^' = g^^^N^, with g^"' defined as inverse of g^^. We 
can now write 



1 



1 



~gX.~g6^ 



(64) 



Quite interestingly, this is (up to a factor N^^) the same metric that appears in the Laplace-Beltrami type 
term of the Wheeler-DeWitt equation (the so-called superspace metric), see [5]. Let us also define the inverse 
metric 

N ^ ^ 

Gxsnv = —7= [gsngxv + gsvgxn — gsxgnv] , (65) 



satisfying 



X5 ■ 



(66) 

As to the other brackets in ((62)) . we will not derive them explicitely here. In fact, H^'^^^ does not simplify a lot. 
Note that H^'^^^ is antisymmetric with respect to the exchange of the pairs of indices fiv and \5 (in constrast 
to G^'^^^ ^ which is symmetric). 

Having already identified G^^xs as metric, we define H^i,\s as 



HuuXS — GnuapH°''^'^PG, 



(67) 



We introduce the following Dirac bracket 



[A,B]* ^ [A,B] + [A,*^''] G^^xs - [A,*^"^] G^^xs [<^^\B] - [A,^'^''] H^.xs 



(68) 



As always, integration over the arguments of the constraints is understood. It is not hard to verify that we have 
A]* = [^f^". A]* for any A. In particular we now have [^f^", = 0. We can also verify the relation 
[<I>°', $°''']* = 0. As a result, 'I'^* are now first class constraints. (This means that the group of constraints (|56p 
and (j57p in fact contained 4 first class constraints, but we did not recognize them prior to the introduction of 
the Dirac brackets, because we did not consider the correct combination of the constraints.) 

All the second class constraints have now been eliminated, and the remaining set of variables can be chosen to 



be {g^i,, ipfiv, goi, , "001 , P ) • Note that we have chosen ip^^, instead of tt^", because it is easier to eliminate tt'^'^ 
than ipfj^i, (see (1561) . This is, however, merely a matter of convenience, and once we have explicitely evaluated the 
Dirac brackets between all the variables, we can easily reintroduce n^'^ at any stage. We will not perform 
this task completely here, but a few relations will be given below. 
The canonical Hamiltonian is constructed from 



n = 



ik ■ 

TT gik 



ikra / 
- P Vikm 



(69) 
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which has to be expressed in terms of [g^v, 111^,901, tpoi)- Taking into account the first class constraints 
and 5"°', we find for the total Hamiltonian 

nT = n + J (A, + fi, d^x. (70) 

From this Hamiltonian, properly expressed in terms of the independent variables, together with the Dirac 
brackets we can check for eventual secondary constraints. Note that — It turns out that p'^^ 

commutates with the Hamiltonian and does not generate secondary constraints. On the other hand, 
will generate four secondary constraints, the so-called Hamiltonian constraints. The explicit calculations are 
straightforward, but rather lengthy, and will not be carried out here. 

Nevertheless, we will give one Dirac bracket explicitely, in order to compare with the corresponding result 
of the conventional first order approach. As is easily shown, we have [^^'^,tp\s] = as well as ['^'^'^,g\s] = 0. 
Further, we find ['i'^" ,ip\s] = ~^\s^ ^^'^ [^^'^tQxs] = ^^xs^- From those relations, we can evaluate the Dirac 
bracket 

[lpal3,gtj.v]* = Ga0fj.u S. (71) 

We recall that the Dirac bracket is the starting point for the transition to the second quantized theory (see 
[4]). Therefore this relation (which becomes ultimately the commutator between and 5^^) has to be valid 
independently of the specific choice of variables. Thus, the same relation should hold in the conventional first 
order approach, if only g^^ is expressed in terms of the corresponding phase space variables. 
Indeed, in the first order approach, we have [B] 

n^^'^^^^jiK^-' -r^K), (72) 

where the subscript (1) refers to the choice of variables in the first order approach. K^^, is defined as 

= In-\N^,, + N,^^ - ~g^,). (73) 

This can be inverted to 

9t,u = 2-^ (^TT^f^ - ^nj^^g^sg"'^^ gai^gp,^ + ■■■ (74) 

where the dots indicate that there are additional terms, that do not depend on the momenta T^"iy The Poisson 

brackets in the first order theory are assumed to be [T^"fy gfj.}^](i) = — <5"f?^. It is now an easy task to derive the 
symbolic relation 

[gaf3,gfj.iy](l) ^ Gal3,i,^S, (75) 

which holds if gap is expressed properly in terms of T^"fy (Note that 5^,^ = 5^,^ in the signature convention of 
[6], and = — 5^jy in our convention, but this difference is obviously not relevant for the final relation.) We 
conclude that both approaches ultimately lead to the same commutator between g^i, and (7^^. This provides 
strong evidence that the elimination of the second class constraints has been done consistently. 

6 Discussion 

In the previous section, we have treated General Relativity as a constrained second order field theory. We could 
successfully eliminate the second class constraints and the resulting theory is quite similar to the conventional 
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first order approach, containing 8 primary first class constraints, 4 of which are trivial and are expected not 
to lead to secondary constraints, while the remaining 4 are expected to lead to the so-called Hamiltonian 
constraints, similar as in the conventional approach. 

What we have gained at this point is simply the fact that the Hamiltonian equals by construction the 
canonical field energy, including all eventual surface terms. It is important that, in order to achieve this, it is 
not only necessary to use a second order formalism, but rather to use our specific, modified formalism, since, 
as we have outlined in section [4l the conventional Ostrogradski Hamiltonian differs by a surface term from the 
canonical energy. 

The fact, however, that the second order formulation allows us to start from the manifestly covariant 
Lagrangian \J—g R leads to further simplifications. As outlined in [2], the primary as well as the secondary 
first class constraints related to diffeomorphism invariance can be found by inspection of the Noether currents. 
Indeed, as a consequence of Noether's theorem, for a generally covariant second order theory, four relations can 
be derived merely from invariance under coordinate transformations — > + , see [1] . They are obtained 
by a successive localization of the coordinate translations 0. The first, obtained from ^* = = const, is the 
conservation of t\ in the form i.e., t\ — 0. The second, obtained from — e^^.x'"' with constant e\., 



allows for t\ to be written in the form The third and fourth, from — e' 



and — e' 



respectively, lead to the mentioned symmetry properties of the brackets in the expression ([3]). 

Those relations, when integrated over a spacelike hypersurface, lead directly to the first class constraints 
that arise as a result of the same symmetries. Let us start with the last one, i.e.. 



dC 



m,l 



dcTfc. 



As a result of diffeomorphism invariance, the totally symmetric part in mlk of the integrand vanishes (see [2] 
for details). If we choose again = 5^ for the normal vector to the hypersurface, then we must have 



dC 



d^x = 0, 



or simply (omitting the factor 1/2 and the integration for simplicity) 



(76) 



(77) 



(Recall that = p° in our notation, see ^ and ([5])-) In particular, for a symmetric tensor field, we have 
{<^9im)\ = "^iS^gii + Sigirn), and we find 



P (CTff/m) I = 4p gi„i = 0, 



(78) 



where we recall our additional convention p'^ — p and thus, for the tensor case, p'*^" = p**^. As expected, this is 
equivalent to the primary first class constraint ^f'^' = = 0, see ((57)) . 

Similarly, from the antisymmetry in kl of the first bracket in ([3]), we find, integrating over dak and choosing 

nk = SI 

dC I dC dC 



Written in terms of momenta, we find 



d^x = 



(79) 



W,* - \{^+P'",k)i'^V)\+Pm[{<TV)^],m = 



(80) 
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For the metric theory, this can be written in the form 

1 ,^ 



Oi 



:P ghn,kg 



(81) 



If we ehminate p'*^™ in terms of the dynamical variables (i.e, if we impose the second class constraints (|54p 
and ([55]) as well as and '5^'' from (l56|) and (l57|) ). we find that the above relations are identical to the first 
class constraints — from ([56|) . Note, however, that the constraints in the form ([78|) and ([8T|) will arise in 
any covariant second order tensor theory, irrespective of the specific Lagrangian and of the eventual presence of 
additional (second class) constraints. 

Thus, we have recovered the primary first class constraints directly from the Noether relations obtained in 
[2] from the successive localization of the coordinate translations — > + Ci^)- 

Finally, the secondary first class constraints can be obtained from the fact that t\ can be expressed both in 
the canonical form ([2]) and in the form ([3]). In other words, for the canonical field momentum, we can write 



V^ = 



dC 



1 dC 



2 dip, 



dL 



dC 



d^x. 



(82) 



The expression in the second line is a surface term, as can be shown with the help of the symmetry properties 
of the brackets under the integral (see [2 ) . In order to express it in terms of the canonical momenta tt and 
(that is, to ehminate the terms containing tt^'^^ and p^^^^ appearing in ([5^ ). it is actually preferable to use first 
the symmetry properties and then integrate (that is, change first the order of the indices kl in the first term of 
^ and then integrate, and similar for the second term). This is also necessary to find the generators of the 
coordinate transformations, see [2]. Again, the relation (|82p is valid in any generally covariant second order 
theory. 

In any case, we see that the secondary constraints express the fact that the canonical field momentum Vi, 
and thus in particular the Hamiltonian T-L = Vq^ is equal to a surface term. Just as in the conventional first 
order approach, the Hamiltonian vanishes weakly up to a surface term. While in the first order approach, this 
has to be checked explicitely, in our approach, it can be anticipated right from the start, since we work with an 
explicitely covariant Lagrangian, and can therefore make use of the full power of Noether's theorem. 

As a result, there is no need to explicitely evaluate the Hamiltonian in the form (|69p . because it will turn 
out to be weakly equal to the above surface term. All we have to do is to properly express (j82p in terms of 
the dynamical variables, and to construct the Hamiltonian which will thus consist of a surface term and of the 
primary and secondary first class constraints derived previously. 

The action of the constraints on the variables and their relation to the generators of spacetime translations 
are easily established following along the same lines as in [2j , where the corresponding analysis has been carried 
out for first order theories. In contrast to our previous manipulations, i.e., the elimination of the second class 
constraints and the introduction of the Dirac brackets, which rely heavily on the Hilbert-Einstein Lagrangian, 
the discussion concerning the first class constraints and the generators of translations can be carried out in a 
general form and the results hold for any generally covariant second order field theory. 



7 Conclusions 



We have presented an alternative Hamiltonian formulation for field theories based on Lagrangians that contain 
second derivatives. This formulation differs from the conventional Ostrogradski formalism in that all four partial 
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derivatives of the field are considered as independent phase space variables, while in the Ostrogradski method, 
only the time derivative is considered in that way. In our formulation, the Hamiltonian is by construction 
equal to the canonical field energy, and will differ, in most cases, from the Ostrogradski Hamiltonian by a 
surface term. It turns out that the additional variables lead to second class constraints and can easily be 
eliminated with the help of the Dirac bracket. The formalism was applied successfully to several constrained 
and unconstrained second order scalar field theories and its equivalence (up to surface terms) to the Ostrogradski 
formulation was established. Finally, the full power of our formulation has been demonstrated by applying it 
to General Relativity. While conventionally. General Relativity is treated as first order theory, which leads 
to difficulties concerning certain surface terms that are omitted in the Lagrangian, but have to be reinserted 
into the Hamiltonian for consistency, the second order formalism allows us to work directly with the explicitly 
covariant Lagrangian. This way, we avoid not only the above problems concerning the surface terms, but 
moreover, the expressions for the primary and secondary first class constraints as well as their action on the 
field variables and their relation to the generators of coordinate transformations can be directly established 
from the general structure of the Noether currents. 
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